智能论文笔记

Statistical model for describing heart rate variability in normal rhythm and atrial fibrillation

Nikita Markov , Ilya Kotov , Konstantin Ushenin , Yakov Bozhko

分类：机器学习

2022-07-17

心率变异性（HRV）指数描述了心电图（ECG）中脱节间隔的特性。通常，HRV仅在不包括任何形式的阵发性节奏（NSR）的正常节奏（NSR）中测量。心房颤动（AF）是人口中最广泛的心律不齐。通常，这种异常节奏不会被分析，也没有被认为是混乱且不可预测的。但是，患有AF的患者的HRV指数范围有所不同，但影响了它们的生理特征却差不多。在这项研究中，我们提出了一个统计模型，该模型描述了NSR和AF中HRV指数之间的关系。该模型基于Mahalanobis距离，K-Neartible邻居方法和多元正态分布框架。使用从长期的Holter ECG中提取的NSR和AF的10分钟间隔进行了该方法的验证。为了进行验证，我们在K折过程中使用了Bhattacharyya距离和Kolmogorov-Smirnov 2样品测试。该模型能够以高精度预测至少7个HRV指数。

translated by 谷歌翻译

FedPop: A Bayesian Approach for Personalised Federated Learning

Nikita Kotelevskii , Maxime Vono , Eric Moulines , Alain Durmus

分类：机器学习 | (统计)机器学习

2022-06-07

个性化联合学习（FL）旨在协作学习每个客户的机器学习模型。尽管已经朝这个方向取得了希望，但大多数现有方法的工作都不允许进行不确定性量化，这在许多应用中至关重要。此外，跨设备设置中的个性化仍然涉及重要问题，尤其是对于新客户或少量观察的客户。本文旨在填补这些空白。为此，我们提出了一种新颖的方法，通过将个性化的FL重新铸造到人群建模范式中，其中客户的模型涉及固定的共同种群参数和随机效应，旨在解释数据异质性。为了获得我们的方案的融合保证，我们引入了一类新的联合随机优化算法，该算法依赖于马尔可夫链蒙特卡洛方法。与现有的个性化FL方法相比，所提出的方法具有重要的好处：对客户漂移是可靠的，对于推断新客户，最重要的是，在轻度的计算和内存开销中可以不确定性量化。我们为拟议算法提供非质合会收敛的保证，并说明了它们在各种个性化联合学习任务上的表现。

translated by 谷歌翻译

Classifiers are Better Experts for Controllable Text Generation

Askhat Sitdikov , Nikita Balagansky , Daniil Gavrilov , Alexander Markov

分类：自然语言处理 | 机器学习

2022-05-15

本文提出了一种简单的方法，用于使用自由形式分类器（即CAIF采样）基于加权逻辑来控制文本生成。使用任意文本分类器，我们将语言模型逻辑的一小部分调整为指导文本生成，以远离分类器预测。我们试验了避免毒性和情感控制任务，并表明该方法在PPL和DESS准确度指标上基于生成的文本的外部分类器而显着优于最近的PPLM，GEDI和DEXPERTS。此外，与其他方法相比，它更容易实施和调整，并且限制和要求较少。

translated by 谷歌翻译

Unsupervised construction of representations for oil wells via Transformers

Alina Rogulina , Nikita Baramiia , Valerii Kornilov , Sergey Petrakov , Alexey Zaytsev

分类：机器学习

2022-12-29

Determining and predicting reservoir formation properties for newly drilled wells represents a significant challenge. One of the variations of these properties evaluation is well-interval similarity. Many methodologies for similarity learning exist: from rule-based approaches to deep neural networks. Recently, articles adopted, e.g. recurrent neural networks to build a similarity model as we deal with sequential data. Such an approach suffers from short-term memory, as it pays more attention to the end of a sequence. Neural network with Transformer architecture instead cast their attention over all sequences to make a decision. To make them more efficient in terms of computational time, we introduce a limited attention mechanism similar to Informer and Performer architectures. We conduct experiments on open datasets with more than 20 wells making our experiments reliable and suitable for industrial usage. The best results were obtained with our adaptation of the Informer variant of Transformer with ROC AUC 0.982. It outperforms classical approaches with ROC AUC 0.824, Recurrent neural networks with ROC AUC 0.934 and straightforward usage of Transformers with ROC AUC 0.961.

translated by 谷歌翻译

Robustifying Markowitz

Wolfgang Karl Härdle , Yegor Klochkov , Alla Petukhina , Nikita Zhivotovskiy

分类：机器学习

2022-12-28

Markowitz mean-variance portfolios with sample mean and covariance as input parameters feature numerous issues in practice. They perform poorly out of sample due to estimation error, they experience extreme weights together with high sensitivity to change in input parameters. The heavy-tail characteristics of financial time series are in fact the cause for these erratic fluctuations of weights that consequently create substantial transaction costs. In robustifying the weights we present a toolbox for stabilizing costs and weights for global minimum Markowitz portfolios. Utilizing a projected gradient descent (PGD) technique, we avoid the estimation and inversion of the covariance operator as a whole and concentrate on robust estimation of the gradient descent increment. Using modern tools of robust statistics we construct a computationally efficient estimator with almost Gaussian properties based on median-of-means uniformly over weights. This robustified Markowitz approach is confirmed by empirical studies on equity markets. We demonstrate that robustified portfolios reach the lowest turnover compared to shrinkage-based and constrained portfolios while preserving or slightly improving out-of-sample performance.

translated by 谷歌翻译

Investigation and rectification of NIDS datasets and standratized feature set derivation for network attack detection with graph neural networks

Anton Raskovalov , Nikita Gabdullin , Vasily Dolmatov

分类：机器学习

2022-12-26

Network Intrusion and Detection Systems (NIDS) are essential for malicious traffic and cyberattack detection in modern networks. Artificial intelligence-based NIDS are powerful tools that can learn complex data correlations for accurate attack prediction. Graph Neural Networks (GNNs) provide an opportunity to analyze network topology along with flow features which makes them particularly suitable for NIDS applications. However, successful application of such tool requires large amounts of carefully collected and labeled data for training and testing. In this paper we inspect different versions of ToN-IoT dataset and point out inconsistencies in some versions. We filter the full version of ToN-IoT and present a new version labeled ToN-IoT-R. To ensure generalization we propose a new standardized and compact set of flow features which are derived solely from NetFlowv5-compatible data. We separate numeric data and flags into different categories and propose a new dataset-agnostic normalization approach for numeric features. This allows us to preserve meaning of flow flags and we propose to conduct targeted analysis based on, for instance, network protocols. For flow classification we use E-GraphSage algorithm with modified node initialization technique that allows us to add node degree to node features. We achieve high classification accuracy on ToN-IoT-R and compare it with previously published results for ToN-IoT, NF-ToN-IoT, and NF-ToN-IoT-v2. We highlight the importance of careful data collection and labeling and appropriate data preprocessing choice and conclude that the proposed set of features is more applicable for real NIDS due to being less demanding to traffic monitoring equipment while preserving high flow classification accuracy.

translated by 谷歌翻译

Generalizable Natural Language Processing Framework for Migraine Reporting from Social Media

Yuting Guo , Swati Rajwal , Sahithi Lakamana , Chia-Chun Chiang , Paul C. Menell , Adnan H. Shahid , Yi-Chieh Chen , Nikita Chhabra , Wan-Ju Chao , Chieh-Ju Chao

分类：自然语言处理

2022-12-23

Migraine is a high-prevalence and disabling neurological disorder. However, information migraine management in real-world settings could be limited to traditional health information sources. In this paper, we (i) verify that there is substantial migraine-related chatter available on social media (Twitter and Reddit), self-reported by migraine sufferers; (ii) develop a platform-independent text classification system for automatically detecting self-reported migraine-related posts, and (iii) conduct analyses of the self-reported posts to assess the utility of social media for studying this problem. We manually annotated 5750 Twitter posts and 302 Reddit posts. Our system achieved an F1 score of 0.90 on Twitter and 0.93 on Reddit. Analysis of information posted by our 'migraine cohort' revealed the presence of a plethora of relevant information about migraine therapies and patient sentiments associated with them. Our study forms the foundation for conducting an in-depth analysis of migraine-related information using social media data.

translated by 谷歌翻译

Zero-shot Triplet Extraction by Template Infilling

Bosung Kim , Hayate Iso , Nikita Bhutani , Estevam Hruschka , Ndapa Nakashole

分类：自然语言处理

2022-12-21

Triplet extraction aims to extract entities and their corresponding relations in unstructured text. Most existing methods train an extraction model on high-quality training data, and hence are incapable of extracting relations that were not observed during training. Generalizing the model to unseen relations typically requires fine-tuning on synthetic training data which is often noisy and unreliable. In this paper, we argue that reducing triplet extraction to a template filling task over a pre-trained language model can equip the model with zero-shot learning capabilities and enable it to leverage the implicit knowledge in the language model. Embodying these ideas, we propose a novel framework, ZETT (ZEro-shot Triplet extraction by Template infilling), that is based on end-to-end generative transformers. Our experiments show that without any data augmentation or pipeline systems, ZETT can outperform previous state-of-the-art models with 25% less parameters. We further show that ZETT is more robust in detecting entities and can be incorporated with automatically generated templates for relations.

translated by 谷歌翻译

MULTI3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for Natural Language Understanding in Task-Oriented Dialogue

Nikita Moghe , Evgeniia Razumovskaia , Liane Guillou , Ivan Vulić , Anna Korhonen , Alexandra Birch

分类：自然语言处理

2022-12-20

Task-oriented dialogue (TOD) systems have been applied in a range of domains to support human users to achieve specific goals. Systems are typically constructed for a single domain or language and do not generalise well beyond this. Their extension to other languages in particular is restricted by the lack of available training data for many of the world's languages. To support work on Natural Language Understanding (NLU) in TOD across multiple languages and domains simultaneously, we constructed MULTI3NLU++, a multilingual, multi-intent, multi-domain dataset. MULTI3NLU++ extends the English-only NLU++ dataset to include manual translations into a range of high, medium and low resource languages (Spanish, Marathi, Turkish and Amharic), in two domains (banking and hotels). MULTI3NLU++ inherits the multi-intent property of NLU++, where an utterance may be labelled with multiple intents, providing a more realistic representation of a user's goals and aligning with the more complex tasks that commercial systems aim to model. We use MULTI3NLU++ to benchmark state-of-the-art multilingual language models as well as Machine Translation and Question Answering systems for the NLU task of intent detection for TOD systems in the multilingual setting. The results demonstrate the challenging nature of the dataset, particularly in the low-resource language setting.

translated by 谷歌翻译

Extrinsic Evaluation of Machine Translation Metrics

Nikita Moghe , Tom Sherborne , Mark Steedman , Alexandra Birch

分类：自然语言处理 | 人工智能

2022-12-20

Automatic machine translation (MT) metrics are widely used to distinguish the translation qualities of machine translation systems across relatively large test sets (system-level evaluation). However, it is unclear if automatic metrics are reliable at distinguishing good translations from bad translations at the sentence level (segment-level evaluation). In this paper, we investigate how useful MT metrics are at detecting the success of a machine translation component when placed in a larger platform with a downstream task. We evaluate the segment-level performance of the most widely used MT metrics (chrF, COMET, BERTScore, etc.) on three downstream cross-lingual tasks (dialogue state tracking, question answering, and semantic parsing). For each task, we only have access to a monolingual task-specific model. We calculate the correlation between the metric's ability to predict a good/bad translation with the success/failure on the final task for the Translate-Test setup. Our experiments demonstrate that all metrics exhibit negligible correlation with the extrinsic evaluation of the downstream outcomes. We also find that the scores provided by neural metrics are not interpretable mostly because of undefined ranges. Our analysis suggests that future MT metrics be designed to produce error labels rather than scores to facilitate extrinsic evaluation.

translated by 谷歌翻译